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Abstract 

The history and the major results of the study of the topology of 
the large-scale structure are briefly reviewed. Two techniques based on 
percolation theory and the genus curve are discussed. The preliminary 
results of the percolation analysis of the Wiener reconstruction of the 
IRAS l.2Jy redshift catalog are reported. 


1 Introduction 

Looking at the large-scale distributions of galaxies, one may notice that along 
with clusters and groups of galaxies there are also conspicuously oblong con¬ 
centrations of galaxies: hlaments. One may also get an impression that 
hlaments are connected to each other forming a single network spanning 
through the entire sample. The two-point correlation function (the most 
common way of describing the distribution of galaxies) obviously is not sen¬ 
sitive to the geometry and topology of the galaxy distribution. The three- 
and many- point correlation functions generally speaking are sensitive to 
the shapes and probably the topology but become cumbersome very quickly. 
Currently popular averaged moments are easy to interpret, but they loose 
sensitivity to the geometry after averaging over the volume |l|. There have 
been suggested various statistics to characterize the geometry and topology 
of the large-scale structure. In this talk I briefly review the studies of the 
topology of the large-scale galaxy distribution. 

The hrst mention of topology in the context of the large-scale structure 
problem (I am familiar with) was in the 1970 paper by Doroshkevich [^]. 
Studying the formation of the large-scale structure in the pancake scenario 
Doroshkevich calculated the Euler characteristic of the isodensity surfaces 
of the initial Gaussian density held. Both the Euler characteristic and a 
common topological measure used in cosmology, genus, are determined by 
the mean Gaussian curvature of the surface of a constant density. 

In 1982 Zel’dovich noticed that the percolation properties of the nonlinear 
density distribution in the HDM (Hot Dark Matter) model are very different 
from the initial Gaussian held. He also suggested characterizing the topol¬ 
ogy of the nonlinear density distribution by the percolation thresholds [^]. 
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Percolation theory deals with the number and properties of the “clusters”, 
which are dehned as connected regions bounded by the surfaces of a con¬ 
stant density. Following Zel’dovich’s idea the author of this talk suggested 
to use percolation properties of the galaxy distribution as an objective quan¬ 
titative measure of the topology of the large-scale structure and also as a 
discriminator between cosmological models [^], p[. 

The percolation technique was utilized in the study of the CfA I catalog 
[|]. It was found that the large-scale distribution of galaxies had a network 
structure. Theoretical studies of the models with the power law initial spectra 
showed that the n = — 1 model clearly percolated better than the n = 0 model 
and also in the = 1 universe the n = —1 model was in an agreement with 
the observations [0. The percolation method showed that the CDM (Cold 
Dark Matter) model appeared hlamentary rather than hierarchical |P], [^j. 
It was also pointed out that the major disadvantage of percolation technique 
was the dependence of the percolation thresholds on the mean density of the 
sample m which made it difficult to apply to sparse samples. Similarly, we 
note that at present some believe that sparse samples can be reliably used 
for the estimation of the two-point correlation function only |TI[ . 


In 1986 Bardeen et al and Gott, Melott and Dickinson rediscovered 
Doroshkevich’s idea of utilizing the Euler characteristic and expanded it to 
the nonlinear distributions as well as galaxy catalogs |^. (Both percola¬ 
tion and genus techniques assumes some kind of smoothing when applied 
to galaxy distributions.) However, instead of the mean density of the Euler 
characteristic x used by Doroshkevich they introduced the mean density of 
genus, g, which is proportional to the Euler characteristic: g = —x/2. (For a 
general review of this method see e.g. 


Tomita gave a very elegant 
analytic expression for the mean Euler characteristic for a D-dimensional 
Gaussian held which in three-dimensional space yields the familiar equation 
for the mean genus density 
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where u = 6/as is the number of standard deviations by which the 
threshold density departs from the mean density, 6 = {p — p)/p, < >= 

/ k‘^P{k)d?k/ J P{k)d^k, and P{k) is the power spectrum (see e.g. |^). In a 
Gaussian held the genus curve has a maximum at z/ = 0 with the amplitude 
determined by the characteristic scale of the density held < k^ >. Since it 
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depends on the slope of the spectrum it is often used as a measure of the 
“effective” slope of the spectrum (see e.g. 0 ); however, it is worth mention¬ 
ing that < > has a stronger dependence on the smoothing scale than on 

the spectral index. Therefore even weak nonlinearity on the smoothing scale 
may influence the estimate of the spectral index. The genus curve changes 
sign two times at z/ = ±1 which signihes the qualitative change of the topol¬ 
ogy of the surface separating high (5 > 5^) and low (5 < 5c) density regions 
where 5c is a chosen density threshold. As suggested by Eq.| there are only 
two types of qualitatively different topologies: 1) one phase percolates and 
the other does not (negative genus) and 2) both phases percolate (positive 
genus). In the range — 1 < z/ < 1 both phases percolate through the whole 
region and this is often reffered to as sponge topology. At z/ < 1 the low 
density regions do not percolate and at z/ > 1 the high density regions do 
not percolate. The topology of the separating surfaces is obviously the same 
sX V = XzVc as measured by Eq.^. But in cosmological literature it is labeled 
either as a meatball or bubble topology depending on whether the high or 
low density phase does not percolate. The network structure obviously has a 
sponge topology however the term emphasizes a geometrical aspect of a non- 
Gaussian density held: the high density regions at the percolation threshold 
occupy a smaller volume than that of a Gaussian held. 

The change of the genus sign is believed to coincide with the percola¬ 
tion thresholds however there is no theorem proving that. Intuitively, it is 
plausible for distributions resulting from Gaussian helds due to gravitational 
instability if one believes in a common interpretation of the genus as the mean 
density of the number of holes minus the number of the isolated regions and 
that clusters with holes inside do not form. Under these conditions, we as¬ 
sume that percolation thresholds coincide with the changes of the genus sign 
at least approximately. 

One advantage of the genus method is the existence of the analytic ex¬ 
pression for Gaussian random helds (Eq.|^. Recently there has also been an 
analytic expression obtained in the weakly nonlinear regime [^]. However, 


one should not forget that the mean genus is a statistical measure and there¬ 
fore an estimate of errors is needed before it becomes meaningful. The errors 
for hnite samples having hnite resolution can be estimated only from numer¬ 
ical simulations. Percolation parameters are also calculated numerically, but 
if one can estimate the errors he almost certainly can estimate the mean with 
similar accuracy. 
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It has been claimed that the percolation thresholds are the most sensitive 
discriminators of the models |^. The recent study of the CfA II catalog using 
the genus method seems to support that suggestion. The authors reduced 
the information of the genus curve to three numbers one of which was the 
genus peak width MA = i^+ — i'- where z/_|_ and z/_ are the levels at which the 
genus changes the sign. Fig. 12 through 14 in [|T^] clearly demonstrate that 
Wi, has the highest discriminating power. However, we still believe that the 
percolation thresholds, and z/_, should be interpreted separately because 
they carry independent information about the topology of the structure. 


2 Largest “cluster” and largest “void” 

Percolation theory deals with the number and properties of the “clusters”. In 
the absence of a better term we label as “clusters” the regions bounded by the 
surfaces of chosen constant density. In order to avoid confusion with clusters 
of galaxies, we will use quotation marks when talking about “clusters” with 
this non-astronomical meaning. The density threshold Sc separating high 
(5 > Sc) and low {S < Sc) density regions is assumed to be a free parameter 
Sc > —1. Analyzing discrete distributions (e.g. galaxy distributions) we 
assume a smoothing procedure creating a continuous density distribution. 

At every density threshold all “clusters” and “voids” are identihed and 
various types of analysis can be performed @]. However here we present 
only the results of the study of the largest “cluster” and the largest “void” 
as functions of a density threshold. (The largest “void” is dehned as the 
largest “cluster” in the low density phase.) The full analysis will be presented 
elsewhere [^, The choice of the largest structure is determined by the 
fact that at the percolation threshold the largest structure become inhnite 
which signihes the change of topology. 

The density threshold is not a convenient parameter if linear (Gaussian) 
and nonlinear density distributions are to be compared. Instead we utilize 
the hlling factor to parameterize the density threshold [|^]. The hlling factor 
is the fraction of the volume occupied by the given phase. In this case one 
can easily compare the properties of “clusters” with that of “voids” and 
also linear and nonlinear density distributions. It is similar to comparing 
different patterns provided that the same amount of paint was used to make 
each pattern. The hlling factor as a function of the density threshold is 
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obviously the cumulative distribution function. 

The largest “cluster” and “void” are measured as a fraction of the cor¬ 
responding hlling factor. Thus if the largest “cluster” is 0.9 at hlling factor 
of 0.2 it means that the density is higher than the chosen threshold in 20% 
of the volume and almost all of that volume (90%) comprised of only one 
connected region. 

The top two panels of Fig.l show the largest “cluster” and the largest 
“void” for the density distributions obtained in the N-body simulation of 
the power law model with n = — 1 at two stages of evolution: \ni = 1/8 
and l/ALbox- The simulations have been done with 128^ particles on the 
equivalent mesh but for this analysis the mesh has been reduced to 


64^. Error bars show Icr deviations from the mean obtained in four different 
realizations of the model. 

The qualitative behavior of both of the largest structures is universal: 
at small hlling factors the largest structure is negligible then at some hlling 
factor it quickly grows and becomes the only signihcant structure in the cor¬ 
responding phase. This is the percolation transition and also the indication 
of the change of topology. In Gaussian helds there is no statistical diherence 
between “clusters” and “voids” and the transition happens at a hlling factor 
of about 16% corresponding to ly = ±1. However, a hnite size of the sam¬ 
ple as well as hnite resolution biases the transition. In order to avoid these 
ehects, we obtain the “Gaussian” distribution by mixing the phases of the 
Fourier transform of the nonlinear density distributions in question. This 
automatically includes all hnite grid ehects in the reference Gaussian held 
keeping the Fourier amplitudes exactly the same. This allows for the genera¬ 
tion of as many Gaussian realizations with identical amplitudes as needed to 
estimate the dispersion. The Gaussian largest structure is shown as a dotted 
line in Fig.l (hidden by the shade of the error bars) lying between the solid 
and dashed lines. 

The major feature of the nonlinear distribution is that the largest “clus¬ 
ter” percolates easier and the largest “void” harder than in the Gaussian case. 
The signihcance of this conclusion for the largest cluster is at the many-cr 
level (see Fig.l). Qualitatively this remains true for all models we have stud¬ 
ied (n = 1,0, —1, —2, —3, GDM, and G-I-HDM @]), but quantitatively the 
transitions are different. The high density regions form a connected net¬ 
work spanning through the whole region when the hlling factor is relatively 
small (smaller than in the Gaussian case) and therefore this transition can 
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be labeled as a shift toward the network structure. On the contrary the low 
density regions do not form a percolating void (remain isolated) even when 
the hlling factor of the low density phase is greater than that in the Gaus¬ 
sian held. This type of transition can be labeled as a shift toward the bubble 
structure. The range of the sponge topology is typically (but not necessarily) 
increased compared to the Gaussian case. Thus the above changes also can be 
labeled as a shift to a sponge topology. However, the major point is not how 
to label a structure but rather to show that in a general case the two shifts 
are independent of each other and carry independent information about the 
structure. Therefore combining them into one parameter (like Wy = z/+ — 
mentioned above) results in lost information. 

At small hlling factors the largest structure must be negligible in suffi¬ 
ciently large samples. The actual (hnite) sizes of the largest structures can 
be used as an internal characteristic of the fairness of the sample. 

In the past the percolation technique has been successfully applied to 
volume limited samples |P]. Analyzing the statistically homogeneous distri¬ 
butions is very easy and the largest structures clearly distinguish between the 


models 


However, the analysis is much more difficult if the distribution 


has a radial gradients, like the IRAS 1.2Jy redshift catalog. 


3 The IRAS 1.2Jy catalog 

We analyze the whole-sky galaxy distribution in real space reconstructed 
from the redshift IRAS 1.2Jy catalog using a Wiener hlter and an expansion 
in spherical harmonics . The Wiener hlter effectively uses a variable win¬ 
dow size which is about 500km/s at 2000km/s and increases to 1800km/s 
at lOOOOfcm/s. The resulting smoothed galaxy distribution is not statisti¬ 
cally homogeneous. This is the major challenge for applying the percolation 
technique. 

In order to test the effect of the reconstruction we generated two density 
distributions from the N-body simulation {n = —1, kni = 8): one with the 
Wiener hitering and the other without it. The percolation statistics of these 
distributions are shown in the middle and bottom panels of Fig.l. The panels 
on the left hand side show the largest structures in the reconstructed density 
held without the Wiener hlter and on the right hand side with the Wiener 
hlter applied. The panels in the middle row show the results for the sphere 
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Figure 1: The N-body simulations of the n = —1 model. The top row show 
two stages of evolution \ni = 1/8 and l/AL^ox before reconstruction. The 
largest “cluster” (solid line) and largest “void” (dashed line) is shown as a 
function of Filing factor; the Gaussian realization with the identical Fourier 
amplitudes is shown in between (shaded by the error bars). The middle and 
bottom panels are the reconstructions without the Wiener hlter (on the left 
hand side) and with the Wiener filter (on the right hand side). Two different 
radii of the sphere has been used: i? = 30 and 24 mesh units. 

Figure 2: The Wiener reconstruction of the IRAS 1.2Jy redshift catalog in 
spherical harmonics. Solid lines show the largest “cluster” and dashed lines 
show the largest “void” at two values of P = and at two radii of the 

spherical region. 


with the radius of 30 mesh units and the bottom panels show the results for 
smaller sphere with the radius of 24 mesh units. 

The middle and bottom panels show only one realization which demon¬ 
strates a substantial change in the topological properties due to the recon¬ 
struction procedure. We plot here only one realization for better visual com¬ 
parison with the data. Fig.2 shows the largest structures in the hltered galaxy 
distribution assuming two different P = /h. This preliminary result is in 

a very general agreement with the n = —1 model. The models with P = 0.1 
and /5 = 1 do not look much different. Unfortunately the method of estimat¬ 
ing the dispersion described above does not work in the inhomogeneous case. 
We are working now on the improved version of estimating the dispersion. 

The largest structures in the galaxy distribution are quite large at small 
Filing factors: 30 — 40% of the corresponding Filing factor. It may mean that 
the sample is not large enogh for this kind of analysis. 

4 Summary 

The quantitative topology proved to be a useful technique for studying the 
large-scale structure in the universe. There have been suggested two diFerent 
methods for quantitative characterization of the topology. One widely used 
method measures the mean density of genus of a constant density surface as 
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a function of the density threshold. It actually measures the mean Gaussian 
curvature of the surface. The other more geometrical method is based on 
percolation theory. Here the number and properties of the “clusters” are 
studied. In particular the volume of the largest “cluster” measured as a 
function of a density threshold plays an important role. 

In general the questions addressed by the two methods are similar but not 
identical. In the case where the questions coincide the two approaches use 
different methods of solving them. In particular, they treat the boundaries 
differently and are probably affected differently by noise. 

The genus curve method is more developed and has been applied to many 
catalogs of galaxies (both two-dimensional and redshift surveys). The major 
results indicate that. non-Gaussian behavior has been detected in both types 
of the catalogs (see e.g. |2^, |jT^, [|^ and references therein). The charac¬ 


terization of the structure is somewhat conflicting. Almost all possible labels 
have been assigned to the galaxy distributions: meatball, sponge, network, 
bubble topology. However, it is likely that the structure seen in different 
catalogs looks different. The estimate of effective slope is in agreement with 
n = -1 H. 


The percolation method has been tested in various theoretical models and 
developed to the level when it can be applied to the galaxy catalogs. For the 
first time we try to apply it to the magnitude limited sample and hnd the 
preliminary result encouraging. 

I am grateful to Gapp Yess and Karl Fisher for allowing me to report the 
preliminary results of a common unhnished work and Francis Bernardeau for 
useful discussions during the meeting. I acknowledge the AAS travel grant, 
NSF grant AST-9021414, and University of Kansas GRF-94 grant. 
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